# ImageNet Optimization
Hiera Base 224 In1k Hf
Hiera is a hierarchical vision Transformer model that is fast, powerful, and concise. It surpasses state-of-the-art performance in a wide range of image and video tasks while significantly improving runtime speed.
Image Classification
Transformers English

H
facebook
188
2
Tecoa2 Clip
MIT
A vision-language model initialized from OpenAI CLIP, adversarially fine-tuned on ImageNet with enhanced robustness features
Text-to-Image
T
chs20
53
1
Fare4 Clip
MIT
Vision-language model initialized with OpenAI CLIP, enhanced robustness through unsupervised adversarial fine-tuning
Text-to-Image
F
chs20
45
1
Vit Hybrid Base Bit 384
Apache-2.0
The Hybrid Vision Transformer (ViT) model combines convolutional networks and Transformer architectures for image classification tasks, excelling on ImageNet.
Image Classification
Transformers

V
google
992.28k
6
Convnext Large 224
Apache-2.0
ConvNeXT is a pure convolutional model inspired by vision Transformers, trained on the ImageNet-1k dataset at 224x224 resolution.
Image Classification
Transformers

C
facebook
740
27
Convnext Base 224
Apache-2.0
ConvNeXT is a pure convolutional model inspired by vision Transformers, trained on the ImageNet-1k dataset for image classification tasks.
Image Classification
Transformers

C
facebook
2,756
9
Convnext Small 224
Apache-2.0
ConvNeXT is a pure convolutional model inspired by vision transformers, trained on the ImageNet-1k dataset, outperforming traditional vision transformers.
Image Classification
Transformers

C
facebook
586
5
Featured Recommended AI Models